A sustainable development OCR system in CADAL application

نویسنده

  • HUANG Chen
چکیده

This paper briefly introduces the main ideas of a sustainable development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books Digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital library projects.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Application of Machine Translation in China-America Digital Academic Library

This paper briefly introduces the main ideas of machine translation (MT) technique, then discusses the application of MT to the China-America Digital Academic Library (CADAL). Index Terms — CADAL, machine translation, digital library

متن کامل

Research Justice in Medical Sciences Universities: Old Concept with a New Application

Research is key element of sustainable development and long term development is impossible without integrated research system . The most important step in creating useful and efficient research platform will be strengthen motivation of researchers in universities particularly medical sciences universities to researc

متن کامل

Development of a Spatial Model for Locating Optimal Areas of Sustainable Physical Development Using Fuzzy Logic (Case Study: Hamadan City)

Today, physical development and population growth in Iranian cities, like other developing countries, is on the rise. One of the main problems in the urban area is the lack of attention to the influential parameters in the sustainable urban development.  Various factors, such as natural phenomena, play a role in the urban development, and the effective parameters must be considered for locatin...

متن کامل

Improving CADAL Portal Usability: Book Search, Reading, Reporting Services

CADAL has been open to the public for more than two years. We have received a lot of positive feedbacks to improve the usability of CADAL portal. In this paper, we present our works in improving the usability of book search, reading and reporting services in CADAL portal. First, we present a quick book search application whose effective hybrid ranking mechanism combines content similarities wit...

متن کامل

Cluster management standards in Poland in the context of sustainable development

The study relates to the problem of cluster management in the conditions of sustainable development. Against the background of the assumptions and conditions for sustainable development, the specificity of the cluster activity in the conditions of the Polish economy has been presented. The objective of the paper is to characterize the cluster management standards in Poland. Such standards have ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007